Using Adaptive Probing for Real-Time Problem Diagnosis in Distributed Computer Systems
نویسندگان
چکیده
In this work, we focus on cost-efficient techniques for realtime diagnosis in distributed systems that allow an adaptive, on-line selection and execution of appropriate measurements (tests). Particularly, one of our applications concerns fault diagnosis in distributed computer systems and networks by using test transactions, or probes (e.g., ”traceroute” or ”ping” commands). The key efficiency issues include both the cost of probing (e.g., the number of probes), and the computational complexity of diagnosis. In our past work (see (Rish, Brodie, & Ma 2002a)), we derived some theoretical conditions on the number of probes required for an asymptotic error-free diagnosis, and developed efficient search techniques for probe set selection that can greatly reduce the probe set size while maintaining its diagnostic capability (Brodie, Rish, & Ma 2001). Next, we considered the problem of real-time diagnosis as a probabilistic inference in Bayesian networks and investigated simple and efficient local approximation techniques, based on variable-elimination (the minibucket scheme (Dechter & Rish 2002)). Our empirical studies show that these approximations ”degrade gracefully” with noise and often yield an optimal solution when noise is low enough, and our initial theoretical analysis explains this behavior for the simplest (greedy) approximation (Rish, Brodie, & Ma 2002a; 2002b). Our future work will focus on adapting more sophisticated approximation techniques, such as Generalized Belief Propagation (Yedidia, Freeman, & Weiss 2001), to real-time scenarios, and a real-time, incremental learning of Dynamic Bayesian Networks based on the historic data and the feedback on the diagnosis results.
منابع مشابه
Detecting and counting vehicles using adaptive background subtraction and morphological operators in real time systems
vehicle detection and classification of vehicles play an important role in decision making for the purpose of traffic control and management.this paper presents novel approach of automating detecting and counting vehicles for traffic monitoring through the usage of background subtraction and morphological operators. We present adaptive background subtraction that is compatible with weather and ...
متن کاملADAPTIVE FUZZY TRACKING CONTROL FOR A CLASS OF NONLINEAR SYSTEMS WITH UNKNOWN DISTRIBUTED TIME-VARYING DELAYS AND UNKNOWN CONTROL DIRECTIONS
In this paper, an adaptive fuzzy control scheme is proposed for a class of perturbed strict-feedback nonlinear systems with unknown discrete and distributed time-varying delays, and the proposed design method does not require a priori knowledge of the signs of the control gains.Based on the backstepping technique, the adaptive fuzzy controller is constructed. The main contributions of the paper...
متن کاملProblem Diagnosis in Distributed Systems using Active Probing
As distributed systems continue to grow in size and complexity, scalable and cost-effective techniques are needed for performing tasks such as problem determination and fault diagnosis. We address these tasks using probes, or end-to-end test transactions, which gather information about system components (e.g., using IBM’s EPP technology). Effective probing requires minimizing the cost of probin...
متن کاملOptimal Combined and Adaptive Protection of Active Distribution Networks Considering Different System Topologies Incorporating Optimal Selection of Standard Relay Curves
The change in the topology of active distribution networks (ADNs) is one of the essential challenges that might affect the protection schemes. The conventional protection schemes based on base topology result in some coordination constraint violations in other topologies due to the outage of upstream substations and distributed generation units. In this article, new combinational and adaptive p...
متن کاملA New Adaptive Load-Shedding and Restoration Strategy for Autonomous Operation of Microgrids: A Real-Time Study
Islanding operation is one of the main features of a MicroGrid (MG), which is realized regarding the presence of distributed energy resources (DERs). However, in order to deal with the control challenges, which an MG faces during island operation, particularly when the transition is associated with certain excessive load, an efficient control strategy is required. This paper introduces a Centra...
متن کامل